• a I o ~ F , Enror3 Corroclflj: by llio STIC Systorr.3 sfaC^x.f ^/ 
ChWwgod a filo (torn non-ASCII lo ASCII Vor'l'fl^by-- ] 

□ Changed tho margins in cases whoro Iho sequence toxt was -wrapped* down lo the nexl lint 

□ Edited a lormal error in the Currenl Application Oala section. specifically: ^\t~J Q 

O Ediled the Currenl Application Data section with ihe actual current number The n.«nh-r • 

applicant was □ ihe prior application data; or □ other " mber ' npi 

□ Added (ho mandatory heading and subheadings lor "Current Application Data". 

n "iled tho -Numbor of Sequences" (fold. Tho applicant spelled out a number instead of'using 

(□ Changed Iho spelling ol a mandatory field (Ihe headings or subheadings)! specifically,: 



□ Corrected Iho SEQ ID NO when obviously incorrect. Tho sequence numbers that were edited 



□ Corroded subheading placement. All responses must bo on Iho same lino ^ ^ kk 

appl.cantp.aced a response below tho subheading, this was <^t?£££g£^ 

f~] Inserted colons alter hoadings/subhoaiiiJ!\|lo7d;jE 




Q De,olcd ex1 . ra - ''"^"d. headings used by an applicant, specifically: 




Delelod: S-rfon^SCII "garbage" al tho beginning/end ol lilos; O secrolarv mHi^filnn,™ 
□ page numbers throughout texi; □ other invalid lex... suc/i a7 * ""l-als/Wcnamo a 

Inserted mandatory headings, specifically: 
D Corrected an obvious error in Iho response, specifically: 



□ Edited idomifiers where upper case is used but lower case is required, or vice versa. 

□ Corrected an orror in Iho Number ol Sequences liold. specifically. 

□ A "Hard Pago freak" code was inserted by .he applicant. At. occurrences had lo bo d. e 7o.ed. 
CD Other: 



Examiner: J^bovo corrections mus. bo communicated to the applicant in (he firs. C 

Acuon. DO NOT send a copy of this form. 
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PCT09 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/831 , 426 



DATE: 11/19/2001 _ 
TIME: 14:12:38 RECEIVED 



4 

5 
6 
8 
10 
12 
13 
15 
17 
19 
20 
21 
22 
24 
25 
26 
28 



Input Set : A:\PTO.MH.txt 

Output Set: N:\CRF3\11192001\l831426.raw 

<110> APPLICANT: Hoechst Marion Roussel 
Bordon-Pallier, F. 
Rocher, C. 
TITLE OF INVENTION: 
FILE REFERENCE: 14 6. 
CURRENT APPLICATION 
CURRENT FILING DATE: 



NOV 2 0 2002 

TECH CENTER 1600/2900 



<120> 
<130> 
<140> 
<141> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<400> 



Human htFIIIA gene and coded htFIIIA protein 
1364 

NUMBER: US 09/831,426 
2001-05-08 



NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patentln Vers. 
SEQ ID NO: 1 
LENGTH: 1273 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



2.0 



ENTERED 



CDS 

(176) . . (1270) 
1 



29 atgcgcagca gcggcgccga cgcggggcgg tgcctggtga ccgcgcgcgc tcccggaagt 60 
31 gtgccggcgt cgcgcgaagg ttcagcaggg agccgtgggc cgggcgcgcc ggttcccggc 120 

33 acgtgtctcg gcacgtggca gcgcgcctgg ccctgggctt ggaggcgccg gcgcc ctg 178 

34 Met 

35 1 



37 


gat 


ccg 


ccg 


gec 


gtg 


gtc 


gec 


gag 


teg 


gtg 


teg 


tec 


ttg 


acc 


ate 


gee 


226 


38 


Asp 


Pro 


Pro 


Ala 


Val 


Val 


Ala 


Glu 


Ser 


Val 


Ser 


Ser 


Leu 


Thr 


He 


Ala 




39 








5 










10 










15 








41 


gac 


gcg 


ttc 


att 


gca 


gee 


ggc 


gag 


age 


tea 


get 


ccg 


acc 


ccg 


ccg 


cgc 


274 


42 


Asp 


Ala 


Phe 


He 


Ala 


Ala 


Gly 


Glu 


Ser 


Ser 


Ala 


Pro 


Thr 


Pro 


Pro 


Arg 




43 






20 










25 










30 










45 


ccc 


gcg 


ctt 


ccc 


agg 


agg 


ttc 


ate 


tgc 


tec 


ttc 


cct 


gac 


tgc 


age 


gee 


322 


46 


Pro 


Ala 


Leu 


Pro 


Arg 


Arg 


Phe 


He 


Cys 


Ser 


Phe 


Pro 


Asp 


Cys 


Ser 


Ala 




47 




35 










40 










45 












49 


aat 


tac 


age 


aaa 


gee 


tgg 


aag 


ctt 


gac 


gcg 


cac 


ctg 


tgc 


aag 


cac 


acg 


370 


50 


Asn 


Tyr 


Ser 


Lys 


Ala 


Trp 


Lys 


Leu 


Asp 


Ala 


His 


Leu 


Cys 


Lys 


His 


Thr 




51 


50 










55 










60 










65 




53 


ggg 


gag 


aga 


cca 


ttt 


gtt 


tgt 


gac 


tat 


gaa 


ggg 


tgt 


ggc 


aag 


gec 


ttc 


418 


54 


Gly 


Glu 


Arg 


Pro 


Phe 


Val 


Cys 


Asp 


Tyr 


Glu 


Gly 


Cys 


Gly 


Lys 


Ala 


Phe 




55 










70 










75 










80 






57 


ate 


agg 


gac 


tac 


cat 


ctg 


age 


cgc 


cac 


att 


ctg 


act 


cac 


aca 


gga 


gaa 


466 


58 


He 


Arg 


Asp 


Tyr 


His 


Leu 


Ser 


Arg 


His 


He 


Leu 


Thr 


His 


Thr 


Gly 


Glu 




59 








85 










90 










95 








61 


aag 


ccg 


ttt 


gtt 


tgt 


gca 


gee 


act 


ggc 


tgt 


gat 


caa 


aaa 


ttc 


aac 


aca 


514 


62 


Lys 


Pro 


Phe 


Val 


Cys 


Ala 


Ala 


Thr 


Gly 


Cys 


Asp 


Gin 


Lys 


Phe 


Asn 


Thr 




63 






100 










105 










110 










65 


aaa 


tea 


aac 


ttg 


aag 


aaa 


cat 


ttt 


gaa 


cgc 


aaa 


cat 


gaa 


aat 


caa 


caa 


562 


66 


Lys 


Ser 


Asn 


Leu 


Lys 


Lys 


His 


Phe 


Glu 


Arg 


Lys 


His 


Glu 


Asn 


Gin 


Gin 




67 




115 










120 










125 












69 


aaa 


caa 


tat 


ata 


tgc 


agt 


ttt 


gaa 


gac 


tgt 


aag 


aag 


acc 


ttt 


aag 


aaa 


610 
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RAW SEQUENCE LISTING 

PATENT APPLICATION : US/09/8 31 r 426 



DATE: 11/19/2001 
TIME: 14:12:38 



input Set : A:\PTO.MH.txt 

Output Set: N:\CRF3\11192001\l831426.raw 

70 Lys Gin Tyr lie Cys Ser Phe Glu Asp Cys Lys Lys Thr Phe Lys Lys 

71 130 135 140 145 

73 cat cag cag ctg aaa ate cat cag tgc cag cat acc aat gaa cct eta 

74 His Gin Gin Leu Lys lie His Gin Cys Gin His Thr Asn Glu Pro Leu 

75 150 155 160 

77 ttc aag tgt acc cag gaa gga tgt ggg aaa cac ttt gca tea ccc age 

78 Phe Lys Cys Thr Gin Glu Gly Cys Gly Lys His Phe Ala Ser Pro Ser 



79 



165 



170 



175 



91 210 



215 



220 



225 



93 ttt aaa cgc aaa gat tap ctt aag caa cac atg aaa act cat gec cca 

94 Phe Lys Arg Lys Asp Tyr Leu Lys Gin His Met Lys Thr His Ala Pro 

95 230 235 240 

97 gaa agg gat gta tgt cgc tgt cca aga gaa ggc tgt gga aga acc tat 

98 Glu Arg Asp Val Cys Arg Cys Pro Arg Glu Gly Cys Gly Arg Thr Tyr 

99 245 250 255 

101 act act gtg ttt aat etc caa age cat ate etc tec ttc cat gag gaa 

102 Thr Thr Val Phe Asn Leu Gin Ser His He Leu Ser Phe His Glu" Glu 

103 260 265 270 

105 age cgc cct ttt gtg tgt gaa cat get ggc tgt ggc aaa aca ttt gca 

106 Ser Arg Pro Phe Val Cys Glu His Ala Gly Cys Gly Lys Thr Phe Ala 

107 275 280 285 

109 atg aaa caa agt etc act agg cat get gtt gta cat gat cct gac aag 

110 Met Lys Gin Ser Leu Thr Arg His Ala Val Val His Asp Pro Asp Lys 

111 290 295 300 305 

113 aag aaa atg aag etc aaa gtc aaa aaa tct cgt gaa aaa egg agt ttg 

114 Lys Lys Met Lys Leu Lys Val Lys Lys Ser Arg Glu Lys Arg Ser Leu 

115 310 315 320 

117 gee tct cat etc agt gga tat ate cct ccc aaa agg aaa caa ggg caa 

118 Ala Ser His Leu Ser Gly Tyr He Pro Pro Lys Arg Lys Gin Gly Gin 

119 325 330 335 

121 ggc tta tct ttg tgt caa aac gga gag tea ccc aac tgt gtg gaa gac 

122 Gly Leu Ser Leu Cys Gin Asn Gly Glu Ser Pro Asn Cys Val Glu Asp 

123 340 345 350 

125 aag atg etc teg aca gtt gca gta ctt acc ctt ggc taa 

126 Lys Met Leii Ser Thr Val Ala Val Leu Thr Leu Gly 

127 355 360 365 

130 <210> SEQ ID NO: 2 

131 <211> LENGTH: 365 

132 <212> TYPE: PRT 

133 <213> ORGANISM: Human 
135 <400> SEQUENCE: 2. 



658 



706 



802 



81 aag ctg aaa cga cat gee aag gee cac gag ggc tat gta tgt caa aaa 754 

82 Lys Leu Lys Arg His Ala Lys Ala His Glu Gly Tyr Val Cys Gin Lys 

83 180 185 190 

85 gga tgt tec ttt gtg gca aaa aca tgg acg gaa ctt ctg aaa cat gtg 

86 Gly Cys Ser Phe Val Ala Lys Thr Trp Thr Glu Leu Leu Lys His Val 

87 195 200 205 

89 aga gaa acc cat aaa gag gaa ata eta tgt gaa gta tgc egg aaa aca 

90 Arg Glu Thr His Lys Glu Glu He Leu Cys Glu Val Cys Arg Lys Thr 



850 



898 



946 



994 



1042 



1090 



1138 



1186 



1234 



1273 



file://C:\Crf3\Outhold\VsrI83 1 426.htm 



11/19/01 



RAW SEQUENCE LISTING DATE: 11/19/2001 

PATENT APPLICATION: US/09/831,426 TIME: 14:12:38 



Input Set : A:\PTO.MH.txt 

Output Set: N:\CRF3\11192001\I831426.raw 



136 


Met 


Asp 


Pro 


Pro 


Ala 


Val 


Val 


Ala 


Glu 


Ser 


Val 


Ser 


Ser 


Leu 


Thr 


He 


137 


1 








5 










10 










15 




139 


Ala 


Asp 


Ala 


Phe 


lie 


Ala 


Ala Gly 


Glu 


Ser 


Ser 


Ala 


Pro 


Thr 


Pro 


Pro 


140 








20 










25 










30 






142 


Ara 


Pro 


Ala 


Leu 


Pro 


Ara 


Arg 


Phe 


He 


Cys 


Ser 


Phe 


Pro 


Asp 


Cys 


Ser 


143 






35 










40 










45 








145 


Ala 


Asn 


Tvr 


Ser 


Lvs 


Ala 


Trp 


Lys 


Leu 


Asp 


Ala 


His 


Leu 


Cvs 


Lvs 


His 


146 




50 










55 










60 










148 


Thr 


Glv 


Glu 


Arcr 


Pro 


Phe 


Val 


Cys 


Asp 


Tvr 


Glu 


Gly 


Cvs 


Gly 


LVS 


Ala 


149 


65 










70 










75 










80 


151 


Phe 


lie 


Arg 


Asp 


Tvr 


His 


Leu 


Ser 


Ara 


His 


He 


Leu 


Thr 


His 


Thr 


Glv 


152 










85 










90 










95 




154 


Glu 


Lys 


Pro 


Phe 


Val 


Cys 


Ala 


Ala 


Thr 


Glv 


Cys 


Asp 


Gin 


Lys 


Phe 


Asn 


155 








100 










105 










110 






157 


Thr 


Lys 


Ser 


Asn 


Leu 


Lys 


Lys 


His 


Phe 


Glu 


Arcr 


Lys 


His 


Glu 


Asn 


Gin 


158 






115 










120 










125 








160 


Gin 


Lys 


Gin 


Tvr 


lie 


Cys 


Ser 


Phe 


Glu 


Asp 


Cys 


Lys 


Lys 


Thr 


Phe 


Lys 


161 




130 










135 










140 










163 


Lys 


His 


Gin 


Gin 


Leu 


Lys 


He 


His 


Gin 


Cys 


Gin 


His 


Thr 


Asn 


Glu 


Pro 


164 


145 










150 










155 










160 


166 


Leu 


Phe 


Lys 


Cvs 


Thr 


Gin 


Glu 


Gly 


Cvs 


Glv 


Lys 


His 


Phe 


Ala 


Ser 


Pro 


167 










165 










170 










175 




169 


Ser 


Lys 


Leu 


Lys 


Arg 


His 


Ala 


Lys 


Ala 


His 


Glu 


Glv 


Tvr 


Val 


Cys 


Gin 


170 








180 










185 










190 






172 


Lys 


Gly Cys 


Ser 


Phe 


Val 


Ala 


Lys 


Thr 


Trp 


Thr 


Glu 


Leu 


Leu 


Lvs 


His 


173 






195 










200 










205 








175 


Val 


Arg 


Glu 


Thr 


His 


Lys 


Glu 


Glu 


He 


Leu 


Cys 


Glu 


Val 


Cvs 


Ara 


Lvs 


176 




210 










215 










220 










178 


Thr 


Phe 


Lys 




Lys 


Asp 


Tvr 


Leu 


Lys 


Gin 


His 


Met 


Lys 


Thr 


His 


Ala 


179 


225 










230 










235 










240 


181 


Pro 


Glu 


Arg 


Asp 


Val 


Cys 


Arg 


Cys 


Pro 


Ara 


Glu 


Glv 


Cys 


Glv 

JT 


Arcr 


Thr 


182 










245 










250 










255 




184 


Tvr 

-Z 


Thr 


Thr 


Val 


Phe 


Asn 


Leu 


Gin 


Ser 


His 


lie 


Leu 


Ser 


Phe 


His 


Glu 


185 








260 










265 










270 






187 


Glu 


Ser 


Arg 


Pro 


Phe 


Val 


c y s 


Glu 


His 


Ala 


Gly 


Cys 


Gly 


Lys 


Thr 


Phe 


188 






275 










280 










285 








190 


Ala 


Met 


Lys 


Gin 


Ser 


Leu 


Thr 


Arg 


His 


Ala 


Val 


Val 


His 


Asp 


Pro 


Asp 


191 




290 










295 










300 










193 


Lys 


Lys 


Lys 


Met 


Lys 


Leu 


Lys 


Val 


Lys 


Lys 


Ser 


Arg 


Glu 


Lys 


Arg 


Ser 


194 


305 










310 










315 










320 


196 


Leu 


Ala 


Ser 


His 


Leu 


Ser 


Gly 


Tyr 


He 


Pro 


Pro 


Lys 


Arg 


Lys 


Gin 


Gly 


197 










325 










330 










335 




199 


Gin 


Gly 


Leu 


Ser 


Leu 


Cys 


Gin 


Asn 


Gly 


Glu 


Ser 


Pro 


Asn 


Cys 


Val 


Glu 


200 








340 










345 










350 






202 


Asp 


Lys 


Met 


Leu 


Ser 


Thr 


Val 


Ala 


Val 


Leu 


Thr 


Leu 


Gly 









203 355 360 365 

206 <210> SEQ ID NO: 3 

207 <211> LENGTH: 1273 

208 <212> TYPE: DNA 
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209 <213> ORGANISM: Human 

211 <400> SEQUENCE: 3 

212 atgcgcagca gcggcgccga cgcggggcgg tgcctggtga ccgcgcgcgc tcccggaagt 60 
214 gtgccggcgt cgcgcgaagg ttcagcaggg agccgtgggc cgggcgcgcc ggttcccggc 120 
216 acgtgtctcg gcacgtggca gcgcgcctgg ccctgggctt ggaggcgccg gcgccctgga 180 
218 tccgccggcc gtggtcgccg agtcggtgtc gtccttgacc atcgccgacg cgttcattgc 240 
220 agccggcgag agctcagctc cgaccccgcc gcgccccgcg cttcccagga ggttcatctg 300 
222 ctccttccct gactgcagcg ccaattacag caaagcctgg aagcttgacg cgcacctgtg 360 
224 caagcacacg ggggagagac catttgtttg tgactatgaa gggtgtggca aggccttcat 420 
226 cagggactac catctgagcc gccacattct gactcacaca ggagaaaagc cgtttgtttg 480 
228 tgcagccact ggctgtgatc aaaaattcaa cacaaaatca aacttgaaga aacattttga 540 
230 acgcaaacat gaaaatcaac aaaaacaata tatatgcagt tttgaagact gtaagaagac 600 
232 ctttaagaaa catcagcagc tgaaaatcca tcagtgccag cataccaatg aacctctatt 660 
234 caagtgtacc caggaaggat gtgggaaaca ctttgcatca cccagcaagc tgaaacgaca 720 
236 tgccaaggcc cacgagggct atgtatgtca aaaaggatgt tcctttgtgg caaaaacatg 780 
238 gacggaactt ctgaaacatg tgagagaaac ccataaagag gaaatactat gtgaagtatg 840 
240 ccggaaaaca tttaaacgca aagattacct taagcaacac atgaaaactc atgccccaga 900 
242 aagggatgta tgtcgctgtc caagagaagg ctgtggaaga acctatacta ctgtgtttaa 960 
244 tctccaaagc catatcctct ccttccatga ggaaagccgc ccttttgtgt gtgaacatgc 1020 
246 tggctgtggc aaaacatttg caatgaaaca aagtctcact aggcatgctg ttgtacatga 1080 
248 tcctgacaag aagaaaatga agctcaaagt caaaaaatct cgtgaaaaac ggagtttggc 1140 
250 ctctcatctc agtggatata tccctcccaa aaggaaacaa gggcaaggct tatctttgtg 1200 
252 tcaaaacgga gagtcaccca actgtgtgga agacaagatg ctctcgacag ttgcagtact 1260 
254 tacccttggc taa 1273 

257 <210> SEQ ID NO : 4 

258 <211> LENGTH: 1213 

259 <212> TYPE: DNA 

260 <213> ORGANISM: Human 

262 <400> SEQUENCE: 4 

263 gtgccggcgc cgcgcgaagg ttcagcaggg agccgtgggc cgggcgcgcc ggttcccggc 60 
265 acgtgtctcg gcacgtggca gcgcgcctgg ccctgggctt ggaggcgccg gcgccctgga 120 
267 tccgccggcc gtggtcgccg agtcggtgtc gtccttgacc atcgccgacg cgttcattgc 180 
269 agccggcgag agctcagctc cgaccccgcc gcgccccgcg cttcccagga ggttcatctg 240 
271 ctccttccct gactgcagcg ccaattacag caaagcctgg aagcttgacg cgcacctgtg 300 
273 caagcacacg ggggagagac catttgtttg tgactatgaa gggtgtggca aggccttcat 360 
275 cagggactac catctgagcc gccacattct gactcacaca ggagaaaagc cgtttgtttg 420 
277 tgcagccact ggctgtgatc aaaaattcaa cacaaaatca aacttgaaga aacattttga 480 
279 acgcaaacat gaaaatcaac aaaaacaata tatatgcagt tttgaagact gtaagaagac 540 
281 ctttaagaaa catcagcagc tgaaaatcca tcagtgccag cataccaatg aacctctatt 600 
283 caagtgtacc caggaaggat gtgggaaaca ctttgcatca cccagcaagc tgaaacgaca 660 
285 tgccaaggcc cacgagggct atgtatgtca aaaaggatgt tcctttgtgg caaaaacatg 720 
287 gacggaactt ctgaaacatg tgagagaaac ccataaagag gaaatactat gtgaagtatg 780 
289 ccggaaaaca tttaaacgca aagattacct taagcaacac atgaaaactc atgccccaga 840 
291 aagggatgta tgtcgctgtc caagagaagg ctgtggaaga acctatacta ctgtgtttaa 900 
293 tctccaaagc catatcctct ccttccatga ggaaagccgc ccttttgtgt gtgaacatgc 960 
295 tggctgtggc aaaacatttg caatgaaaca aagtctcact aggcatgctg ttgtacatga 1020 
297 tcctgacaag aagaaaatga agctcaaagt caaaaaatct cgtgaaaaac ggagtttggc 1080 
299 ctctcatctc agtggatata tccctcccaa aaggaaacaa gggcaaggct tatctttgtg 1140 
301 tcaaaacgga gagtcaccca actgtgtgga agacaagatg ctctcgacag ttgcagtact 1200 
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Output Set 
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303 



tacccttggc taa 



1213 



306 <210> SEQ ID NO: 5 

307 <211> LENGTH: 34 

308 <212> TYPE: DNA 

309 <213> ORGANISM: Human 

311 <400> SEQUENCE: 5 

312 cggggtacca aaaatgcgca gcagcggcgc cgac 34 

315 <210> SEQ ID NO: 6 

316 <211> LENGTH: 21 

317 <212> TYPE: DNA 

318 <213> ORGANISM: Human 

320 <400> SEQUENCE: 6 

321 tccttccctg actgcagcgc c 21 

324 <210> SEQ ID NO: 7 

325 <211> LENGTH: 20 

326 <212> TYPE: DNA 

327 <213> ORGANISM: Human 

329 <400> SEQUENCE: 7 

330 tgcacaggtg cgcgtcaagc 20 

333 <210> SEQ ID NO: 8 

334 <211> LENGTH: 20 

335 <212> TYPE : DNA 

336 <213> ORGANISM: Human 

338 <400> SEQUENCE: 8 

339 cacaaacaaa tggtctctcc 20 

342 <210> SEQ ID NO: 9 

343 <211> LENGTH: 30 

344 <212> TYPE: DNA 

345 <213> ORGANISM: Human 

347 <400> SEQUENCE: 9 

348 cggtctagat tagccaaggg taagtactgc 30 

351 <210> SEQ ID NO: 10 

352 <211> LENGTH: 30 

353 <212> TYPE: DNA 

354 <213> ORGANISM: Human 

356 <400> SEQUENCE: 10 

357 cctcccgggg ccaagggtaa gtactgcaac 30 
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